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Abstract 

Background: Geographic variables play an important role in the study of epidemics. The role of one such variable, 
population density, in the spread of influenza is controversial. Prior studies have tested for such a role using 
arbitrary thresholds for population density above or below which places are hypothesized to have higher or lower 
mortality. The results of such studies are mixed. The objective of this study is to estimate, rather than assume, a 
threshold level of population density that separates low-density regions from high-density regions on the basis of 
population loss during an influenza pandemic. We study the case of the influenza pandemic of 1918-19 in India, 
where over 15 million people died in the short span of less than one year. 

Methods: Using data from six censuses for 199 districts of India (n=1 194), the country with the largest number of 
deaths from the influenza of 1918-19, we use a sample-splitting method embedded within a population growth 
model that explicitly quantifies population loss from the pandemic to estimate a threshold level of population 
density that separates low-density districts from high-density districts. 

Results: The results demonstrate a threshold level of population density of 175 people per square mile. A 
concurrent finding is that districts on the low side of the threshold experienced rates of population loss (3.72%) 
that were lower than districts on the high side of the threshold (4.69%). 

Conclusions: This paper introduces a useful analytic tool to the health geographic literature. It illustrates an application 
of the tool to demonstrate that it can be useful for pandemic awareness and preparedness efforts. Specifically, it 
estimates a level of population density above which policies to socially distance, redistribute or quarantine populations 
are likely to be more effective than they are for areas with population densities that lie below the threshold. 
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Background 

Studying influenza pandemics of the past may offer valu- 
able lessons for preparedness for the next great pan- 
demic [1,2]. In this paper, we analyze the 1918 influenza 
pandemic, during which up to 100 million people world- 
wide and 670,000 people in the U.S.A. are estimated to 
have died [3-6]. India, where over 15 million perished in 
the short span of one year, was the single worst- affected 
country in terms of total mortality [7,8]. Factors affecting 
mortality from the pandemic included a lack of immun- 
ity to the virus, which resulted in the infection of 
higher-than-normal numbers of people, its highly conta- 
gious nature [9], and the prior presence of other 
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infections such as tuberculosis or subsequent develop- 
ment of pneumonia as the result of infection [10,11]. Be- 
cause influenza viruses spread through human contact, 
geography and population density in particular are po- 
tential factors for transmission and, indirectly, human 
mortality. The aim of this paper is to analyze the role of 
population density in the influenza pandemic of 1918. 
Because the statistics on influenza mortality for India 
are deeply flawed, following Davis [8], we estimate popu- 
lation growth trajectories allowing for a break between 
1918 and 1919 to capture population loss from the dis- 
ease, and use a threshold estimation method to test 
whether low population density districts in India 
experienced rates of population loss that were different 
from high density districts. 
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Epidemiology of the influenza pandemic of 1918-19 in India 

The 1918-19 influenza pandemic was one of the worst 
epidemics in history with an estimated global mortality 
between 20 and 100 million [9,12]. The pandemic oc- 
curred in two or three waves [13,14]. The first was a 
mild wave in the spring of 1918, followed by a second 
more severe wave in the following autumn that was re- 
sponsible for the majority of deaths. The third wave was 
sporadic [15]. The influenza pandemic of 1918-19 in 
India shared many characteristics with the pandemic in 
other parts of Asia and the world. In this account, we 
draw heavily on the Report of the Sanitary Commis- 
sioner of India for 1918 [16] and Chandra [17]. The 
virus is believed to have entered India in the early 
months of 1918 through the port of Bombay on the west 
coast. This first wave was relatively mild. The same early 
and mild first wave was observed in other countries in- 
cluding Indonesia [18], England, Scotland, and Wales 
[19], Portugal and Spain [20], Mexico [21], and Peru 
[22], and cities including New York City [23] and Co- 
penhagen [24]. It subsided by August, only to be 
followed by a second and far more virulent wave that 
peaked between September and November 1918 in vari- 
ous parts of India. Indonesia [17], England, Scotland and 
Wales [19], Portugal and Spain [20] and Mexico [21] 
experienced a similar pattern of timing and relative se- 
verity. Populations that were exposed to the virus early 
have been shown to have benefited from acquired im- 
munity against the deadly second wave in Denmark [24], 
Norway [25], and Britain [26,27]. A distinctive character- 
istic of this epidemic was its disproportionate impact on 
victims aged between 15 and 35 years [6,14,15,28-30]. 
This phenomenon was also observed in India [16]. 

Epidemics and urban responses in history 

It is no coincidence that the public health community 
worries about new and virulent infectious diseases [31]. 
As recently as 2009, an episode of pandemic influenza is 
estimated to have claimed between 151,700 and 575,500 
lives worldwide [32]. While vaccinations are frequently 
the first line of defense against influenza viruses, devel- 
opment of vaccinations for a new strain of virus may 
take months. In such a situation, only short-term 
measures including social distancing, and in extreme 
cases evacuations and quarantines, can protect citizens 
from a severe epidemic outbreak. The early introduction 
of social distancing measures, such as school and church 
closures and banning of mass gatherings, significantly 
reduced excess mortality during the 1918-19 influenza 
pandemic [33]. With similar contagious diseases, such as 
SARS, plague and cholera, quarantines and evacuations 
have been used to counter severe outbreaks. In the 15 th 
century, for example, the government of Venice 



combated outbreaks of plague by establishing the 
"Lazzaretto Vecchio" on a small island off the coast of 
the Piazza San Marco [34]. More recently, in the late 
19 th and early 20 th centuries, New York City quarantined 
travelers on Swinburne and Hoffman islands to prevent 
the spread of cholera [35]. At about the same time, the 
city established the Metropolitan Board of Health to de- 
velop zoning codes to prevent overcrowding in the city 
and to establish standards for sanitary conditions [36]. 
With reference to the influenza pandemic of 1918-19 in 
India, the Sanitary Commissioner of India wrote: "As the 
striking distance of the influenza virus is probably short 
the obvious ideal is free ventilation and isolation of 
sufferers with a view to increase the air space between 
infected and uninfected" [16, p.66]. 

Over the past two decades, sudden virus outbreaks 
that could have led to widespread human pandemics, 
including H1N1 [37], SARS [38], and H5N1 [39] 
prompted a series of studies on non-pharmaceutical 
interventions [40-48], including measures to increase 
social distance, such as the creation of spatial barriers 
through quarantine [49,50], relocating populations to 
safe' areas [51,52], or imposing travel restrictions [53]. 
Evacuations, a last resort among social distancing 
measures, are still used; in the aftermath of the Haiti 
hurricane of November, 2009, and the subsequent 
earthquake of January 12, 2010, the government 
ordered the evacuation of the capital, Port-au-Prince, to 
prevent the spread of epidemic cholera [54]. In sum, it 
is widely believed that public health interventions, in- 
cluding social distancing measures and the controlled 
movement of people to either sequester those infected 
or as a means to lower population density below some 
critical threshold can significantly decrease the likeli- 
hood of a contagious disease spreading. Yet, to date, 
there is little if any guidance as to what such a popula- 
tion density threshold might be for any disease. Using 
the influenza pandemic of 1918 as a case, this paper 
presents an approach to identifying such a threshold 
value as a guideline for public health policy. 

Population density as a factor in influenza population loss 

Studies examining the potential relationship between po- 
pulation density and mortality during the 1918 influenza 
pandemic have produced mixed results. Garrett [55] 
found a positive relationship between mortality rates and 
population densities measured on a state-wide scale in 
the USA. Once cities were introduced into the equation, 
normalizing the mortality rate of the cities with those of 
the states also showed a positive relationship with popu- 
lation density. For Nigeria, crowding contributed to 
comparatively higher mortality than less-crowded areas, 
and "there is enough evidence to support the view that 
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large towns suffered more than small and remote 
villages" [56]. During the 2009-10 influenza pandemic, 
it was observed that H1N1 infections were sustained 
over longer periods of time in Taiwanese areas with 
higher population densities [57]. Theoretical models for 
influenza and other transmissible respiratory diseases 
consisting of agent, host, and environment interactions 
usually require a high host density [58]. 

Chowell et al. [13] also identified a link between popu- 
lation density and mortality for the 1918 pandemic, 
though it is the opposite of Garrett's [55] findings; in 
Wales and England, low population density in rural 
areas was positively associated with mortality. On a lar- 
ger county scale, however, they found no connection 
between population density or residential crowding and 
mortality or transmissibility. Supporting this stream of 
research, Mills et al. [59] as well as Nishiura and 
Chowell [60] could not identify an association between 
mortality and population size or density, measured as 
household size. While a review of the literature suggests 
that the evidence linking higher population densities 
with higher mortality rates is mixed, intrinsically this re- 
lationship makes sense, because influenza viruses spread 
via human interactions [9]. With rapidly rising popula- 
tion densities around the world, the creation of mega 
cities, and growing international connectivity, there is, 
therefore, a dire need for more research on this 
phenomenon. Given its high population density and 
rapid urbanization, India is of particular interest for the 
study of the emergence and spread of viruses posing sig- 
nificant pandemic threats [61,62]. 

Despite the interest in the link between population 
density and influenza morbidity and mortality, little is 
known about critical turning points or population dens- 
ity thresholds above which the demographic cost of the 
pandemic may have exceeded that of low-population 
density areas. Previous studies on the 1918 pandemic 
have emphasized the size of populations, be they in 
cities, towns, or rural areas, by selecting arbitrary 



thresholds on the basis of jurisdictional sizes or 
reporting boundaries. In some instances the choice of 
threshold appears to have been made on the basis of 
convenience (i.e., using categories that may have been 
originally created by the producers of the data using 
some criterion other than epidemiology (see Table 1)). 
Therefore, there is a hitherto unfulfilled need for a delib- 
erate exercise to determine thresholds that focus on the 
epidemiologic phenomenon at hand. The aim of this 
paper is to apply a threshold estimation method to iden- 
tify a population density threshold separating high- 
density districts in British India from low-density 
districts. In estimating this threshold, we simultaneously 
test the hypothesis that the low-density districts so iden- 
tified differed from high-density districts in terms of 
population loss. As the world becomes increasingly 
urbanized, knowledge of how to estimate critical levels 
of population density above which populations may be 
at graver risk of contracting or succumbing to influenza 
than populations in lower density areas can play an im- 
portant role in fostering pandemic preparedness. The 
findings of our study contribute directly to the field of 
spatial epidemiology, which is concerned with "the study 
of spatial variation in disease risk or incidence" to assist 
public health decision making [63, p.328; 64]. 

Results and discussion 

Estimates from the initial set of models, described below, 
indicated multiple possible thresholds for population 
density. The point estimate of the threshold in model 
(1), described in the methods section below, was 19,067 
PPSM (people per square mile), for which only one dis- 
trict, Calcutta, lay above the threshold (see the table in 
the Appendix). The results from this model suggest that 
Calcutta, the most densely populated district in British 
India, with a population density of 35,025 PPSM, is an 
outlier. Therefore, Calcutta was removed from the 
dataset and the subsequent analyses were conducted 



Table 1 Sample of studies using population size or density thresholds 



Study 


Area 


Time period 


Unit of analysis 


Thresholds used 


Reasons for 




studied 


studied 






selecting threshold 


McSweeney 


New Zealand 


October 1 7 - 


Entire country was divided 


Cities (> 20,000) 


Not specified 


et al. [65] 




December 27, 1918 


into cities, large towns, 












small towns, and counties 






Garrett [55] 


United States 


1918 - 1919 


Individual states and 49 cities 


Cities (> 100,000) 


Data availability 


Kolte et al. [66] 


Denmark 


1917 - 1921 


22 counties, each divided 


Capitals, provincial towns, 


Based on weekly reports sent 








into countryside and towns 


rural areas 


to county health officials 


Nishiura and 


Japan 




Kanagawa, 199 regions 


Cities (> 20,000); 


McSweeney et al. [65] and 5,000 


Chowell [60] 








Large towns (5,000 - 20,000); 


minimum population size for 










Small towns (2,000 - 5,000); 


town status in Japan 










Villages (< 2,000) 




Chowell [13] 


England and 


June 29, 1918- 


305 administrative units, 


Cities, towns, and rural areas 


Urbanization not defined 




Wales 


May 10, 1919 


62 counties 
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Table 2 Threshold models for influenza population loss 



Coefficient estimate 



Model specification: District-specific intercepts and growth rates 



Threshold point estimate 



Alternate threshold 



Time trend ((3 10 ) 
Flu dummy ((3 2 o) 

Low density * flu dummy ((3 30 ) 

Time trend * flu dummy ((3 40 ) 
Number of obs. 

R 2 

Threshold population density 
Range of possible thresholds (5% level of significance) 
Number (percentage) of districts outside threshold range 
Population loss as % of population, low density districts 
Population loss as % of population, high density districts 



-0.2965*** 
(0.0126) 

0.0694*** 
(0.0246) 
t 

1188 
0.9964 



KEY DEMOGRAPHIC PHENOMENA 



175 
148-209 

-3.72% 
-4.69% 



110 (56%) 



-0.3144*** 
(0.0172) 

0.0596*** 
(0.0221) 
t 

1188 
0.9964 

435 
381-464 

-3.51% 
-5.85% 



Standard errors in parentheses. 
*** p < 0.01 . 

tMultiple estimates, one corresponding to each district. 



using a dataset of 198 districts and 1,188 observations. 
The results for datasets containing the Calcutta outlier 
in the Appendix are broadly consistent with the results 
presented in the paper. 

Table 2 contains the parameter estimates for the 
models without the Calcutta outlier. These models 
indicated the presence of two discrete intervals of pos- 
sible threshold values. Therefore, we present two models 
(columns 1 and 2) corresponding to the threshold value 



of population density that minimized the threshold test 
statistic in each interval (Figure 1). Figure 1 and Column 
1 in Table 2 shows the results of the threshold estima- 
tion procedure from model (%). The point estimate of 
the threshold value of population density was 175 PPSM. 
At the 5% level of significance, a discontinuous set of 
threshold values that could not be rejected was obtained, 
corresponding to the intervals 148-209 and 381-464 
PPSM (see Figure 1). These ranges are analogous to a 
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Critical value for threshold test statistic, 
-5% level of significance 



Alternate threshold estimate: 
• 435 people per square mile 



Threshold point estimate: 
175 people per square mile 



3,000 

Population Density (Log Scale) 



30,000 



Figure 1 Threshold test statistic: district-specific intercepts and growth rates (Calcutta outlier dropped). 
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95% confidence interval. The alternate threshold 
estimates, presented in Column 2 in Table 2, represent 
values of the threshold that (a) cannot be rejected at the 
5% level of significance and (b) yield the minimum 
threshold test statistic over the alternate range of con- 
tiguous possible (in the sense that they cannot be 
rejected) threshold values within which they occur. In 
this case, the point estimate is 435 PPSM, which 
produced the lowest test statistic for the 381-464 PPSM 
interval (see Figure 1). 

An important characteristic of the threshold estima- 
tion procedure is the ability to simultaneously test for 
differences in population loss between below-threshold 
and above-threshold districts. Interestingly, the diffe- 
rence between low-density and high-density districts is 
large. For the 175 PPSM threshold, below-threshold 
districts experienced a population growth rate of -3.72% 
between 1918 and 1919, while above-threshold districts 
experienced a growth rate of -4.69%, for a net difference 
of 0.97%. This difference is, moreover, statistically sig- 
nificant, as denoted by the significance of the parameter 
estimate for (3 30 . For the 435 PPSM threshold, the 
corresponding figures are -3.51%, -5.85%, and 2.31% 
respectively. 

In the above models, district-specific intercepts and 
coefficients on the time trend were also estimated. The 
intercepts correspond to the logarithm of district-specific 
population in 1891 and the coefficients on the time trend 
correspond to the annual rate of population growth. 



Because of the large number of estimates (198 each), these 
are not reported in Table 2. 

Figures 2 and 3 are a spatial illustration of the rela- 
tionship between population density and influenza- 
attributable population loss. Figure 2 is a map of India 
that contrasts the districts that are above and below the 
estimated population density threshold. Figure 3 is a 
map of the districts arranged by quintile of population 
loss using estimates computed in Chandra et al. [7]. In 
general, the coastal areas and Gangetic plain of India 
show coinciding areas of above-threshold population 
density and high rates of population loss. 

Conclusions 

The results of this study suggest the presence of popula- 
tion density thresholds that can be used to separate low 
population loss districts from high population loss 
districts in British India during the influenza pandemic of 
1918. Using Hansen's [67] method of threshold estimation, 
we identified a threshold of 175 PPSM. Below this thresh- 
old, districts experienced a decline in population of 3.72%, 
compared with a 4.69% decline for districts above the 
threshold, indicating a role for population density in 
understanding population loss from the epidemic. This 
evidence is significant in light of the often mixed findings 
of scholars on the relationship between population density 
and influenza mortality. The results from the other 
models presented in this paper suggest even greater 




Legend 

# Major Urban Center 

Population Density 
Population per Square Mile 

| Above 175 

| Below 175 
| Princely State/No Data 



Figure 2 Population density threshold in India, 1918-1919. 
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differences in population loss between low- and high- 
density districts, so the chosen point estimates and 
corresponding difference in this paper should be 
considered conservative estimates. 

Limitations 

While this study advances the literature in a number of 
ways, it has a number of limitations. The data do not 
contain information about cases of influenza or mortal- 
ity that resulted directly from influenza. Therefore, the 
types of available data preclude analysis of case- 
mortality, transmission (with or without mortality), and 
certain other phenomena of epidemiologic significance. 
It is also not possible to ascribe the entire population 
loss from the influenza pandemic to mortality alone be- 
cause of likely depressing effects of the disease on fer- 
tility in the immediate aftermath of the pandemic. Even 
though many studies on the 1918-1919 influenza in 
New Zealand, the USA and European countries have 
attempted to factor fertility into either post-pandemic 
growth rates or depression of conception during the 
pandemic [68-72], in the context of India, Davis [8] 
estimated underreporting of births [73] by as much as 
50% of the true figures. While likely minimal, migration 
effects also cannot be accounted for due to the lack of 
appropriate data. In light of these limitations, we have 
followed the approach of Davis [8], using changes in 
population size between 1918 and 1919 that are not 



explained by the normal population growth trajectory, 
obtained from relatively accurate population census 
data, to create a picture of severity of the pandemic in 
the different districts of India. This paper takes the 
position that the estimated population loss is an indi- 
cator of the severity of the pandemic as a whole, be 
that loss the direct result of influenza or of other 
conditions resulting from influenza, including pneu- 
monia, decreased fertility as a consequence of influ- 
enza or starvation due to pandemic-influenced famine. 
The data also do not contain information about 
socioeconomic status (i.e., poverty, social class or in- 
come, or ethnicity) [5,25,55,74,75] or remoteness [76], 
precluding the inclusion of other potential threshold 
variables that have been shown to be associated with 
mortality outcomes in other contexts. 

The last data limitation described above also raises the 
issue of the constraints that the methodology places on 
the analysis. Thresholds may have temporal and spatial 
variability depending on the above variables and popula- 
tion structure, viral virulence, or transmissibility, though 
there appears to be little by way of theory or evidence on 
this subject. In addition, the threshold estimation method 
used, which has been developed relatively recently, is 
designed to detect a single threshold based on a single 
threshold variable (in this case, population density). A 
more versatile version of the method which allows for 
multiple thresholds estimated by simultaneously using 
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multiple threshold variables would have been ideal. Fi- 
nally, to the extent that the conditions in British India were 
different from conditions encountered in many developed 
and developing countries today, the results need to be 
interpreted in their specific context. 

Strengths 

The above limitations do not detract conceptually from 
the utility of the sample-splitting methodology for the 
identification of demographic or other thresholds for 
health and potentially other phenomena, and for its value 
in establishing the critical role for population density in 
separating high-loss districts from districts that did not 
suffer as much during the single-worst epidemic of the 
20 th century in the single-worst-affected country. In this 
light, this study may be viewed as a prototype on which 
health geographers can build using the more sophisticated 
data that are available in modern contexts and as the 
sample-splitting methodology evolves. In addition, the 
data used in this study cover a larger area and population 
than any other single-country study of the pandemic, and 
thus have the strength of a large sample size. 

More generally, the findings presented above have a 
number of implications for researchers and policy makers 
in demography, epidemiology, planning, and public health. 
Most importantly, they introduce a new analytic method, 
threshold estimation, to the study of epidemics and their 
effects on populations and population growth. Subject to 
possible caveats about data and methodology, they also 
demonstrate that low population density districts in Brit- 
ish India may not have suffered as much as high popula- 
tion density districts from the influenza pandemic of 
1918-19. At a broader level, the mixed nature of results of 
studies of this question and their possible connection to 
contextual factors is an interesting one, and merits further 
study. The results of this study suggest that plans for pan- 
demic preparedness and adaptation can be informed by 
the results of studies using this method, especially where 
high-quality data are available. Threshold estimates can be 
used to inform the public about location-based risk in 
times of such epidemics where such risk is found to be 
present. In addition to introducing a new analytic tool to 
the study of the geography of health, therefore, it is hoped 
that this study will be used as a template to inform 
guidelines for pandemic preparedness issued by public 
health agencies with a view to minimizing the impact of 
such events in the future. 

Methods 

Data sources 

Following earlier studies that used census data [7,17,77], 
the data used in this study were obtained from six 
decennial censuses held in India, for the years 1891, 



1901, 1911, 1921, 1931, and 1941 [78]. We focus on 
these censuses, and not the two censuses preceding 
1891, those of 1872 and 1881, on the basis of Davis' [8] 
diagnosis that the earlier censuses undercounted the 
population by over 1%. In addition, these two earlier 
censuses were conducted using methods that had signifi- 
cantly changed by 1891. 

Within these data, we focus on population figures for 
the districts that were directly ruled by the British India 
government. We do not use parallel data from the 
princely states of India, which were nominally under the 
control of local princes and kings. Data collection by 
administrative authorities was in some cases significantly 
different from the British India administration in cap- 
acity and function. The coverage of the dataset is broad, 
encompassing 199 districts for each of the censuses for a 
total sample size of 1,194 observations. In addition, to 
ensure comparability over time, in the 1941 census, the 
population statistics for each census were reported after 
having been adjusted to conform to the district boundar- 
ies as of the 1941 census [78], providing a convenient 
dataset on population that is comparable across all the 
censuses. The district areas used in the computation of 
population density are, therefore, based on the 1941 
boundaries. 



Methods 

In order to estimate population density thresholds that 
separate low- and high-density districts on the basis of 
population loss from the pandemic, we use the threshold 
estimation technique of Hansen [67] embedded in a 
population growth model as follows. First, we compute 
the mean population density of each district across the six 
censuses as the population of the district divided by the 
area of the district in square miles to yield persons per 
square mile (henceforth PPSM). This mean population 
density provides an estimate of population density at the 
onset of the pandemic, and is used as the ordering vari- 
able for the threshold estimation procedure. It is also 
highly correlated (r = 0.999) with an alternate estimate of 
population density computed using the estimate of popu- 
lation in 1918 computed from the standard exponential 
population growth model allowing for a break in 1918-19 
[7]. After ascertaining that the correlation between this 
mean population density and the dependent variable in 
the population growth model, namely the log of popula- 
tion, is not significantly different from zero (r = 0.036, 
P = 0.219), a condition for the threshold estimation pro- 
cedure, we estimate the population density threshold. The 
use of the mean density rather than census-specific density 
ensures that, during the implementation of the sample- 
splitting algorithm (see below), all six observations for 
each district lie on the same side of the threshold. 
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The model used for this exercise was a fixed effects 
model allowing for each district to have different inter- 
cept and time trend terms to absorb heterogeneity in 
(the log of) population size and population growth, and 
allowing for a drop in the population between 1918 and 
1919, the year of the pandemic. This broad approach 
was also employed in Davis' [8] classic study and 
developed elsewhere [7,17,77]. The general model, 
which follows the approach of the latter three studies, 
can be expressed as 

LPOP it = p oi + $ n T t + p 2 FLU t + $ 3i T t FLU t + e u 

where LPOP it is the log of population in district i in year 
t, T t is a time trend, FLU t is a year-specific indicator vari- 
able defined as 

FLU -I 1 * t<191 * 
tLUt -\0, *>1918 

8 it is a random error term, p 0 i> an d p3i, are 

vectors of district-specific parameters, and f> 2 is a 
(fixed) parameter. To this model, for each of the 
199 possible values of the threshold level of popula- 
tion density (corresponding to the 199 different 
districts in the sample), an indicator variable was 



added to the data such that the variable took on the 
value 1 if the observation was drawn from a district 
with a population density lower than the threshold 
and 0 otherwise. This indicator variable was 
interacted with the variable of interest, namely the 
term capturing the drop in population from the in- 
fluenza pandemic to produce model (1): 

LPOP it = Poi + Pii^ + $2 FLU t + PzhFLUt 

+ $ 4i T t FLU t + E it (1) 

Here, I D is the indicator variable corresponding to the 
threshold population density D [67]. For each of the 199 
possible values of D, the above equation was estimated 
using the 1,194 available observations. The point esti- 
mate of the threshold value of D was the one for which 
the sum of squared errors for the above model was 
minimized. As a robustness check of the above model, 
we also estimated models without district-level hetero- 
geneity in the coefficient estimates corresponding to 
population growth. Thus model (2) was 

LPOP it = Poi + h Tt + P 2 FLU t + P 3 Ir>FLU t (2) 
+ P 4 T t FLU t + ^ 



Appendix 



The Table 3 shows the threshold models for influenza population loss with Calcutta outlier. 
Table 3 Threshold models for influenza population loss with Calcutta outlier 


Model specification: District-specific intercepts and growth rates 


Threshold point estimate 


Alternate threshold 1 


Alternate threshold 2 


Time trend ((3 10 ) 


t 


t 


t 


Flu dummy ((3 2 o) 


-0.7676*** 


-0.2997*** 


-0.3201*** 




(0.1546) 


(0.0128) 


(0.0174) 


Low density * flu dummy ((3 30 ) 


0.4894*** 


0.0726*** 


0.0654*** 




(0.1550) 


(0.0250) 


(0.0224) 


Time trend * flu dummy ((3 40 ) 


t 


t 


t 


Number of obs. 


1194 


1194 


1194 


R 2 


0.9963 


0.9963 


0.9963 




KEY DEMOGRAPHIC PHENOMENA 




Threshold population density 


19067 


175 


435 


Range of possible thresholds (5% level of significance) 


1138-19067 


1 75-207 


430-464 


Number (percentage) of districts outside threshold range 




52 (26%) 




Population loss as % of population, low density districts 


-4.44% 


-3.72% 


-3.51% 


Population loss as % of population, high density districts 


-21.15% 


-4.81% 


-6.06% 



Standard errors in parentheses. 
*** p < 0.01 . 

tMultiple estimates, one corresponding to each district. 



Chandra et al. International Journal of Health Geographies 2013, 12:9 
http://www.ij-healthgeographics.eom/content/12/1/9 



Page 9 of 10 



Competing interests 

None of the authors has any competing interests. 
Authors' contributions 

SC: Project leadership, design and execution of quantitative analysis; writing 
of entire manuscript. EK-N: Project leadership, writing of background, results, 
conclusion, and reference sections. GK: Design and execution of quantitative 
analysis; writing of analytic section. JV: Preparation of maps. All authors read 
and approved the final manuscript. 

Acknowledgements 

The data used in this research were collected with support from the National 
Institute on Drug Abuse (NIDA, grant number 1 R21 DA02591 7-01 A1 (PI: 
Chandra)), National Institutes of Health. Its contents are solely the 
responsibility of the authors and do not necessarily represent the official 
views of NIDA. 

Author details 

1 Asian Studies Center, Michigan State University, 427 N Shaw Lane, Room 
301, East Lansing, Ml 48824, USA. 2 Urban and Transport Planning in the 
School of Planning, Design, and Construction and Global Urban Studies 
Program, 552 W Circle Drive, Room 201 E, East Lansing, Ml 48824, USA. 
department of Psychology, Psychology Building 316 Physics Room 262, East 
Lansing, Ml 48824, USA. 4 Department of Geography, 673 Auditorium Road, 
Room 116, East Lansing, Ml 48824, USA. 

Received: 15 October 2012 Accepted: 13 January 2013 
Published: 20 February 2013 

References 

1. National Institute of Allergy and Infectious Diseases: "Reconstructed 1918 
Influenza Virus Has Yielded Key Insights, Scientists Say.".; 2012. Retrieved 6 
October, 2012, from http://www.niaid.nih.gov/news/newsreleases/2012/ 
Pages/Reconstructed 1 91 8Flu.aspx. 

2. Hobday RAP, Cason JWP: The Open-Air Treatment of Pandemic Influenza. 
Am J Public Health 2009, 99(S2):S236-242. 

3. Lezzoni L: Influenza 1918: The Worst Epidemic in American History. New York, 
NY: TV Books; 1918:1999. 

4. U.S. Department of Health & Human Services: "Pandemic Flu History."; 201 2. 
Retrieved 2012, 7 October, from http://www.flu.gov/pandemic/history/index.html. 

5. Mamelund S-E: A socially neutral disease? Individual social class, 
household wealth and mortality from Spanish influenza in two socially 
contrasting parishes in Kristiania 1918-1919. Social Science & Medicine 

2006, 62(4):923-940. 

6. Johnson NPAS, Mueller J: Updating the accounts: global mortality of the 
1918-1920" Spanish" influenza pandemic. Bulletin of the History of Medicine 
2002, 76(1 ):1 05-1 15. 

7. Chandra S, Kuljanin G, Wray J: Mortality from the Influenza Pandemic of 
1918-1919: The Case of India. Demography 2012, 49(3):857-865. 

8. Davis K: The Population of India and Pakistan. Princeton, NJ: Princeton 
University Press; 1951. 

9. Santibahez S, Fiore A, Merlin T, Redd S: A Primer on Strategies for 
Prevention and Control of Seasonal and Pandemic Influenza. Am J Public 
Health 2009, 99(S2):S2 16-224. 

10. Brundage JF, Shanks GD: Deaths from bacterial pneumonia during 1918- 
19 influenza pandemic. Emerg Infect Dis 2008, 14(8):1 193-1 199. 

11. Brundage JF, Shanks GD: What really happened during the 1918 influenza 
pandemic? The importance of bacterial secondary infections. J Infect Dis 

2007, 196(1 1):1717-1718. 

1 2. Curson P, McCracken K: An Australian perspective of the 1 91 8-1 91 9 influenza 
pandemic. New South Wales Public Health Bulletin 2006, 1 7(8):1 03-1 07. 

13. Chowell G, Luis MAB, Niall J, Wladimir JA, Cede V: The 1918-1919 
influenza pandemic in England and Wales: spatial patterns in 
transmissibility and mortality impact. Proceedings of the Royal Society B 

2008, 275(1 634):501 -509. 

14. Langford C: The age pattern of mortality in the 1918-19 influenza 
pandemic: an attempted explanation based on data for England and 
Wales. Medical history 2002, 46(1):1-14. 

15. Patterson KD, Pyle GF: The geography and mortality of the 1918 influenza 
pandemic. Bulletin of the History of Medicine 1991, 65(1 ):4-21. 



1 6. Sanitary Commissioner: Annual report of the Sanitary Commissioner with the 
Government of India. Calcutta: Superintendent of Government Printing; 1920. 

17. Chandra S: Mortality from the influenza pandemic of 1918-19 in 
Indonesia. Popul Stud (Camb) 2013, [Epub ahead of print]. 

18. Burgerlijk Geneeskundigen Dienst: Reports of the Netherlands-Indian Medical 
Civil Service. Mededeelingen van den Burgerlijken Geneeskundigen dienst in 
Nederlandsch-lndie#776; . Batavia: Landsdrukkerij; 1912:1912-1925. 

19. Johnson N: Britain and the 1918-1919 Influenza Pandemic: A Dark Epilogue. 
Oxford: Routledge; 2006. 

20. Erkoreka A: The Spanish influenza pandemic in occidental Europe 

(1 91 8-1 920) and victim age. Influenza and other respiratory viruses 201 0, 
4(2)51-89. 

21. Chowell G, Viboud C, Simonsen L, Miller MA, Acuna-Soto R: Mortality 
patterns associated with the 1918 influenza pandemic in Mexico: 
evidence for a spring herald wave and lack of preexisting immunity in 
older populations. J Infect Dis 2010, 202(4):567-575. 

22. Chowell G, Viboud C, Simonsen L, Miller MA, Hurtado J, Soto G, Vargas R, 
Guzman M, Ulloa M, Munayco C: The 1918-1920 influenza pandemic in 
Peru. Vaccine 201 1, 29:B21-B26. 

23. Olson DR, Simonsen L, Edelson PJ, Morse SS: Epidemiological evidence of 
an early wave of the 1918 influenza pandemic in New York City. Proc 
Natl Acad Sci USA 2005, 1 02(31 ):1 1059-1 1063. 

24. Andreasen V, Viboud C, Simonsen L: Epidemiologic characterization of the 
1918 influenza pandemic summer wave in Copenhagen: implications for 
pandemic control strategies. J Infect Dis 2008, 197(2):270-278. 

25. Mamelund SE: Spanish influenza mortality of ethnic minorities in Norway 
1918-1919. European Journal of Population/Revue europeenne de 
Demographie 2003, 1 9(1 ):83-1 02. 

26. Barry JM, Viboud C, Simonsen L: Cross-protection between successive waves 
of the 1918-1919 influenza pandemic: epidemiological evidence from US 
Army camps and from Britain. J Infect Dis 2008, 1 98(1 0):1 427-1 434. 

27. Mathews JD, McBryde ES, McVernon J, Pallaghy PK, McCaw JM: Prior 
immunity helps to explain wave-like behaviour of pandemic influenza in 
1918-9. BMC Infect Dis 2010, 10(128):1-9. 

28. Crosby AW: America's forgotten pandemic: the influenza of 1918. Cambridge: 
Cambridge University Press; 2003. 

29. Pyle GF: The diffusion of influenza: patterns and paradigms. Totowa, NJ: 
Rowman & Littlefield Pub Incorporated; 1986. 

30. Barry J: The Great Influenza: The Epic Story of the Deadliest Plague in History. 
New York: Penguin Group; 2004. 

31. Epstein P: Climate Change and Infectious Disease: Stormy Weather 
Ahead? Epidemiology 2002, 13(4)373-375. 

32. Dawood F-S, luliano A-D, Reed C, Meltzer M-l, Shay D-K, Cheng P-Y, 
Bandaranayake D, Breiman R-F, Brooks W-A, Buchy P, et al: Estimated global 
mortality associated with the first 12 months of 2009 pandemic 
influenza A H1N1 virus circulation: a modelling study. Lancet Infect Dis 
2012, 12(9):687-95. 

33. Bootsma MCJ, Ferguson NM: The effect of public health measures on 
the 1918 influenza pandemic in US cities. Proc Natl Acad Sci 2007, 
104(18):7588-7593. 

34. Valsecchi M: Mass Plague Graves Found On Venice 'Quarantine' Island. 
Washington, DC: National Geographic News; 2007. 

35. Seitz S, Miller S: The Other Islands of New York City. Woodstock, VT: 
Countryman Press; 2003. 

36. Sussna S: Bulk control and zoning :the New York city experience. Land 
economics 1 967, 43(2):1 58-1 71 . 

37. Alexandrov BB, Hardman J, Janies DA, Studer J, Treseder TW, Valson C, 
Voronkin IO: Selection for resistance to oseltamivir in seasonal and 
pandemic H1N1 influenza and widespread co-circulation of the lineages. 
IntJHeal Geogr 2010, 9:13. 

38. Lai PC, Wong CM, Hedley AJ, Lo SV, Leung PY, Kong J, Leung GM: 
Understanding the Spatial Clustering of Severe Acute Respiratory Syndrome 
(SARS) in Hong Kong. Environ Heal Perspect 2004, 1 1 2(1 5):1 550-1 556. 

39. Peterson AT, Williams RAJ: Ecology and geography of avian influenza 
(HPAI H5N1) transmission in the Middle East and northeastern Africa. Int 
J Heal Geogr 2009, 8:47. 

40. Anderson RM, May RM: Infectious Diseases of Humans: Dynamics and Control. 
Oxford: Oxford Univ. Press; 1992. 

41. Ferguson NM, Cummings DAT, Cauchemez S, Fraser C, et al: Strategies for 
containing an emerging influenza pandemic in Southeast Asia. Nature 
2005, 437(7056):209-214. 



Chandra et al. International Journal of Health Geographies 2013, 12:9 
http://www.ij-healthgeographics.eom/content/12/1/9 



42. Ferguson NM, Cummings DAT, Fraser C, Cajka JC, Cooley PC, Burke DS: 
Strategies for mitigating an influenza pandemic. Nature 2006, 
442(7101):448-452. 

43. Eubank S, Guclu H, Anil Kumar VS, Marathe MV, et al: Modelling disease 
outbreaks in realistic urban social networks. Nature 2004, 429(6988):180-184. 

44. Germann TC, Kadau K, Longini IM Jr, Macken CA: Mitigation strategies for 
pandemic influenza in the United States. Proc Natl Acad Sci USA 2006, 
103(15):5935-5940. 

45. Glass RJ, Glass LM, Beyeler WE, Min HJ: Targeted social distancing design 
for pandemic influenza. Emerg Infect Dis 2006, 12:1671-1681. 

46. Halloran ME, Ferguson NM, Eubank S, Longini IM Jr, Cummings DAT, Lewis 
B, Xu S, Fraser C, Vullikanti A, Germann TC, et al: Modeling targeted layered 
containment of an influenza pandemic in the United States. Proc Natl 
Acad Sci USA 2008, 1 05(1 2):4639. 

47. Atkinson M, Wein L: Quantifying the Routes of Transmission for Pandemic 
Influenza. Bull Math Biol 2008, 70:820-867. 

48. Wein LM, Atkinson MP: Assessing Infection Control Measures for 
Pandemic Influenza. Risk Analysis 2009, 29(7):949-962. 

49. Cliff AD, Haggett P: Spatial aspects of epidemic control. Prog Hum Geogr 
1989, 13(3)315-347. 

50. Bensimon CM, Upshur REG: Evidence and Effectiveness in Decisionmaking 
for Quarantine. Am J Public Health 2007, 97(1):44-48. 

51. Frenkel S, Western J: Pretext or prophylaxis? Racial segregation and 
malarial mosquitos in a British tropical colony: Sierra Leone. Ann Assoc 
Am Geogr 1988, 78(2):21 1-228. 

52. Curtin PD: Medical knowledge and urban planning in tropical Africa. Am 
Hist Rev]985, 90(3)594-613. 

53. McLafferty S: Placing pandemics: geographical dimensions of vulnerability 
and spread. Eurasian Geography and Economics 2010, 51 (2):1 43—1 61 . 

54. Margesson R: Haiti Earthquake: Crisis and Response. Congressional 
Research Service 2010, 7-5700(R41023):1-58. 

55. Garrett TA: Economic Effects of the 1918 Influenza Pandemic: 
Implications for a Modern-day Pandemic. In Working paper CA0721. 
2010th edition.: Federal Reserve Bank of St. Louis Web site; 2007. 

56. Ohadike DC: Diffusion and Physiological Responses to the Influenza 
Pandemic of 1918-19 in Nigeria. Social Science & Medicine 1991, 
32(1 2):1 393-1 399. 

57. Kao C-L, Chan T-C, Tsai C-H, Chu K-Y, Chuang S-F, Lee C-C, Li Z-RT, Wu K-W, 
Chang L-Y, Shen Y-H, et al: Emerged HA and NA Mutants of the Pandemic 
Influenza H1N1 Viruses with Increasing Epidemiological Significance in 
Taipei and Kaohsiung, Taiwan, 2009-10. PLoS One 2012, 7:e31 162. 

58. Spicknall IH, Koopman JS, Nicas M, Pujol JM, Li S, Eisenberg JNS: Informing 
Optimal Environmental Influenza Interventions: How the Host, Agent, 
and Environment Alter Dominant Routes of Transmission. PLoS 
Computational Biology 201 0, 6(1 0):1 -11. 

59. Mills CE, Robins JM, Lipsitch M: Transmissibility of 1918 pandemic 
influenza. Nature 2004, 432(701 9):904-906. 

60. Nishiura H, Chowell G: Rurality and pandemic influenza: geographic 
heterogeneity in the risks of infection and death in Kanagawa, Japan 
(1 91 8-1 91 9). New Zealand Medical Journal 2008, 121(1 284):6-1 0. 

61 . Gani SR, Ali ST, Kadi AS: he transmission dynamics of pandemic influenza A/ 
H1N1 2009-2010 in India. Current Science (Bangalore) 201 1, 101(8)4065-1072. 

62. Kumar S, Quinn SC: Existing health inequalities in India: informing 
preparedness planning for an influenza pandemic. Health Policy and 
Planning 2012, 27(6):516. 

63. Ostfeld RS, Glass GE, Keesing F: Spatial epidemiology: an emerging (or re- 
emerging) discipline. Trends Ecol Evol 2005, 20:328-336. 

64. Linard C, Tatem AJ: Large-scale spatial population databases in infectious 
disease research. Int J Heal Geogr 201 2, 11 :7. 

65. McSweeny K, Colman A, Fancourt N, Parnell M, Stantiall S, Rice G, Baker M, 
Wilson N: Was rurality protective in the 1918 influenza pandemic in New 
Zealand? New Zealand Medical Journal 2007, 1 20(1 256):2579. 

66. Kolte V, Skinho P, Keiding N, Lynge E: The Spanish flu in Denmark. Scand J 
Infect Dis 2008, 40:538-645. 

67. Hansen BE: Sample Splitting and Threshold Estimation. Econometrica 
2000, 68(3):575-603. 

68. Mamelund SE: Can the Spanish Influenza Pandemic of 1918 Explain the 
Baby Boom of 1920 in Neutral Norway? Popul Engl Ed 2004, 59(2):229-260. 

69. Pool D: The effects of the 1918 pandemic of influenza on the Maori 
population of New Zealand. Bulletin of the History of Medicine 1973, 47(3):273. 



Page 10 of 10 



70. Bloom-Feshbach K, Simonsen L, Viboud C, Molbak K, Miller MA, 
Gottfredsson M, Andreasen V: Natality decline and miscarriages associated 
with the 1918 influenza pandemic: the Scandinavian and United States 
experiences. J Infect Dis 201 1 , 204(8): 1 1 57-1 1 64. 

71. Bloom-Feshbach K, Simonsen L, Viboud C, Molbak K, Miller MA, 
Gottfredsson M, Andreasen V: Reply to Mamelund. J Infect Dis 2012, 
206(1)441-143. 

72. Mamelund SE: Fertility fluctuations in times of war and pandemic 
influenza. J Infect Dis 2012, 206(1 ):1 40-1 41. 

73. Mills ID: The 1918-1919 Influenza Pandemic - The Indian Experience. The 

Indian Economic and Social History Review 1986, XXIII(1):1-40. 

74. Murray CJL, Lopez AD, Chin B, Feehan D, Hill KH: Estimation of potential 
global pandemic influenza mortality on the basis of vital registry data 
from the 1918-20 pandemic: a quantitative analysis. Lancet 2006, 
368(9554):221 1-2218. 

75. Mamelund SE: An egalitarian disease? Socioeconomic status and individual 
survival of the Spanish Influenza pandemic of 1918-19 in the Norwegian 
capital of Kristiania. Oslo: Memorandum, Department of Economics, 
University of Oslo; 2004. 

76. Mamelund SE: Geography may explain adult mortality from the 1918-20 
influenza pandemic. Epidemics 201 1, 3(1)46-60. 

77. Chandra S: Deaths Associated with Influenza Pandemic of 
1918-19, Japan. Emerg Infect Dis 2013, 19(4). In press. 
http://wwwnc.cdc.gOv/eid/article/1 9/4/1 2-01 03_article.htm. 

78. Census of India: Census of India. Calcutta: Superintendent: Government 
Printing; Various years. 



doi:1 0.1 1 86/1 476-072X-1 2-9 

Cite this article as: Chandra et al.: A geographic analysis of population 
density thresholds in the influenza pandemic of 1918-19. International 
Journal of Health Geographies 2013 12:9. 



Submit your next manuscript to BioMed Central 
and take full advantage of: 

• Convenient online submission 

• Thorough peer review 

• No space constraints or color figure charges 

• Immediate publication on acceptance 

• Inclusion in PubMed, CAS, Scopus and Google Scholar 

• Research which is freely available for redistribution 



Submit your manuscript at f~\ RiftMM i rpntral 

www.biomedcentral.com/submit momea central 



